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(57) Abstract 

A method is described for identifying a nucleotide at a de- 
fined point on a nucleic acid sequence. An oligonucleotide probe is 
annealed to a target nucleotide sequence of the nucleic acid sample 
at a point immediately adjacent and 3' to the nucleotide of interest. 
The probe is then extended in the direction of the nucleotide of in- 
terest in a reaction medium containing at least one chain terminat- 
ing nucleotide triphosphate (ATP, GTP, TTP and CTP). The nucle- 
otide of interest is complementary to the labeled nucleotide incor- 
porated into the primer by the extension reaction. 
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TITLE 

METHOD OF IDENTIFYING A NUCLEOTIDE PRESENT 
AT A DEFINED POSITION IN A NUCLEIC ACID 
FIELD OF THE INVENTION 
This invention relates to a rapid, convenient 
process to identify a nucleotide present at a specific 
position in a nucleic acid chain (DNA or RNA) of a 
biological sample. 

BftCKGROUNP OF THE INVENTION 
The nucleic acid content of any organism is the 
essence of that organism, and differences in the nuclei 
acid are known to be of primary importance in 
distinguishing one from another. The science of 
genetics is based on the identification and 
characterization of differences in nucleic acid 
sequence. These differences, or polymorphisms, are 
often termed "mutations" and may be due to nucleotide 
substitution, insertion or deletion. Thus, many 
techniques have been developed to compare homologous 
segments of DNA or RNA to determine if the segments are 
identical or if they differ at one or more nucleotides. 
Identification of genetic polymorphisms is useful for 
genetic diagnoses in medicine, identification of 
individuals in forensic science, identification of 
pathogenic organisms, construction of genetic 
polymorphism maps for locating genes important in 
disease and in agriculture and for breeding of plants 
and animals . 

The most definitive method for comparing DNA 
segments is to determine the complete nucleotide 
sequence of each segment* Examples of how sequencing 
has been used to study mutations in human genes are 
included in the publications of Engelke, et • al., Proc. 
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Natl* Acad. Sci. U.S.A. 85:544-548 (1988) and Wong, et 
al., Nature 330:384-386 (1987). At the present time, it 
is not practical to use extensive sequencing to compare 
more than just a few DNA segments, because the effort 
5 required to determine, interpret, and compare sequence 
information is time-consuming and costly. Development 
of automated sequencing technology; Innis et al., 
WO 9003442; Prober et al., EPO' 252638; Rosenthal et al., 
J. DNA Sequencing and Mapping 1:63-71 (1990); Bauer, 

10 Nucleic Acids Res. 18:879-884 (1990); has made the 

process more efficient and has substituted fluorescent 
reporters in place of radioactivity; however, these 
methods involve a time-consuming separation of reaction 
products using a* polyacrylamide gel, and the nucleotide 

15 sequence information at any one point in a nucleotide 
chain is one of several hundred such pieces of 
information. That is, if one is only interested in 
determining the nucleotide at a specific point in the 
polynucleotide chain (e.g. a position known from 

20 previous genetic analysis to be involved in a disease 
phenotype) , that information must be retrieved from a 
larger volume of data. 

For genetic mapping purposes, the most commonly used 
screen for DNA polymorphisms consists of digesting DNA 

25 with restriction endonucleases and analyzing the 

resulting fragments by means of Southern blots, a method 
known as "Restriction Fragment Length Polymorphism" or 
RFLP mapping, as described by Botstein, et al., Am. J. 
Hum. Genet. 32:314-331 (1980); White, et al., Sci. Am. 

30 258:40-48 (1988). Mutations that affect the recognition 
sequence of the endonuclease will preclude enzymatic 
cleavage at that site, thereby altering the cleavage 
pattern of that DNA. DNAs are compared by looking for 
differences in restriction fragment lengths. RFLP 
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detection in a genomic DNA sample is very labor 
intensive for it requires preliminary steps of genomic 
DNA isolation, restriction, gel electrophoresis and 
Southern transfer steps, before hybridization to a probe 
5 that is generally radioactively labeled for sensitive 
detection of homologous sequences. A major problem 
associated with RELP detection is the necessity of the 
polymorphism to affect cleavage with a restriction 
endonuclease, therefore many mutations cannot be 
10 detected with this method (Jeffreys, Cell 18:1-18, 
1979) . More importantly, although RFLP and several 
other methods in the prior art (e.g. Wallace et al . , 
Nucl. Acids Res. 9:879-894, 1981 or Saiki, et al . , U.S. 
Pat. No. 4,683,194 or Kornher et al., U.S. Pat. No, 
15 4,879,214) are useful for finding polymorphisms in DNA, 
they do not elucidate the exact nature of the nucleotide 
present at a specific position on the nucleic acid 
sequence. In some applications, such as prenatal 
diagnosis, knowledge of which nucleotide is present at a 
20 given position is extremely important, since some 

nucleotide changes do not alter the coding capacity of a 
gene and are therefore "silent" with respect to 
phenotype. Those techniques that elucidate the nature 
of the nucleotide present are discussed below. 
25 Many techniques designed to elucidate the nature of 

a nucleic acid polymorphism involve hybridization with a 
polynucleotide probe, a portion of which is 
complementary to the nucleotide position (s) of interest. 
A target sequence that is perfectly complementary to the 
30 probe can be distinguished from a target that differs by 
as little as a single nucleotide in a variety of ways. 
A technique involving amplification and mismatch 
detection (AMD), described by Montandon et al., Nucl. 
Acids Res. 17:3347-3358, 1989, utilizes amplification of 
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the DNA region of interest from two samples , followed by 
denaturation and reannealing to form homo- and 
heteroduplexes between DNA molecules of the two samples. 
Any nucleotide position that is different between the 
5 two amplified DNAs (i.e. nucleotide mismatch in the 
heteroduplex) can be identified with the use of 
hydroxylamine and osmium tetroxide to modify mispaired 
cytosines and thymines , respectively, followed by 
piperidine-catalysed cleavage of the modified 

10 heteroduplexes, and subsequent gel electrophoresis to 

identify cleavage products. Although this technique and 
the analogous technique of EP0329311 to Campbell and 
Cotton, are useful for both detecting and identifying 
all point mutations within a nucleic acid segment, they 

15 share some of. the same, serious disadvantages of the 

chemical degradation method for DNA sequencing by Maxam 
and Gilbert (Proc. Natl. Acad. Sci. 74:560, 1977). 
These techniques require dangerous chemicals that modify 
and cleave nucleic acids, they involve several different 

20 chemical reactions, and require a time-consuming gel 
assay. 

Several techniques that utilize an oligonucleotide 
probe designed to overlap the position of interest at 
the 3'prime terminus are reviewed below. The technique 

25 described in Landegren, et al.. Science 241:1077-1080 
(1S88) uses an enzymatic detection of polymorphisms. 
For this method, oligonucleotide probes are constructed 
in pairs such that their junction corresponds to the 
specific nucleotide site which is of interest. These 

30 oligonucleotides are then hybridized to the DNA being 
analyzed. Base pair mismatch between either 
oligonucleotide and the target DNA at the junction 
location prevents the efficient joining of the two 
oligonucleotide probes by DNA ligase. The Ligation 
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Amplification Reaction (LAR) as reported by Wu and 
Wallace, Genomics 4:560-569, (1989) and Wallace and 
Skolnick (WO 89/10414) is also dependent upon ligation 
of oligonucleotides whose 3-prime ends include the 
5 nucleotide position of interest. They demonstrate that 
four pairs of oligonucleotides that are complementary to 
the upper and lower strand of the target DNA will be 
exponentially amplified only if there is perfect 
complementarity between the oligonucleotides and the 
10 target DNA. The patent of Vary et al . , U.S. Pat, No. 

4,851,331, also depends upon an enzymatic reaction that 
requires one end of the oligonucleotide probe to form a 
perfect, complementary matched basepair with the target 
nucleotide sequence. As in the examples above, an 
15 oligonucleotide probe is designed such that the 3-prime 
end of the complementary probe includes the specific 
nucleotide position of interest. After annealing this 
oligonucleotide probe to the template DNA, a polymerase 
that replicates nucleic acid strands in a template 
20 directed fashion is used to incorporate modified 

nucleotides into a newly synthesized strand. If the 3- 
prime end of the oligonucleotide probe did not contain a 
nucleotide complementary to the target nucleotide 
sequence, then the polymerase cannot begin the 
25 replication process . The amount of incorporation is a 
measure of the amount of the specific template DNA in 
the biological sample. This same principle of utilizing 
a polymerase to discriminate whether there is a 
mismatched base at the 3-prime end of the primer was 
30 also recently combined with the PCR to give an 

exponential rather than linear increase of the reaction 
products in a process called Allele-specif ic Polymerase 
Chain Reaction (ASPCR) (Wu et al., Proc. Natl. Acad. 
Sci. 86:2757-2760, 1989). In their example, the 
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reaction products were run on an agarose gel and 
detected by ethidium bromide staining. However, 
fluorescently-labeled oligonucleotide primers may also 
be used for detection in ASPCR (e.g. Chehab and Kan, 
5 Proc. Natl. Acad. Sci. 86:9178-9182, 1989). The patent 
of Caskey and Gibbs (EP0333465A2) for a process 
involving Competitive Oligonucleotide Priming (COP) is 
essentially the same. In COP, two differentially 
labeled oligonucleotide primers that differ at the 3 f 

10 nucleotide and overlap the position of interest are 

present in the same, rather than in separate reactions, 
and thus compete for template molecules in the 
hybridization reaction . 

A general problem shared by the various techniques 

15 mentioned in the preceding paragraph is that the 
difference in duplex stability of a perfectly vs 
nonperfectly matched oligonucleotide to its target DNA 
is dependent upon the length and sequence of the 
oligonucleotide. Therefore, regardless of the assay 

20 method, a different set of emperically determined 

experimental reaction conditions may be required in 
order to assay different genomic loci for a 
polymorphism. Secondly, since their assay is a +/- 
assay (a reaction product should be formed or absent) , 

25 it is necessary to perform several assays on a single 
target DNA so that an inference can be made concerning 
the nature of the nucleotide at a given position. This 
is especially important since many organisms are 
diploid, polyploid, or have several copies of a given 

30 sequence such that an individual may be polymorphic at 
any one nucleotide position (e.g. heterozygous) . 
Several reactions utilizing oligonucleotide probes 
differing in sequence only at the 3-prime end may give a 
reaction product in such a situation. Finally, this 
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same result can occur if the assay method is too 
sensitive such that even inefficient ligation or 
replication is detected as a positive signal. 
Misincorporation of nucleotide substrate, well 
5 documented in the literature for polymerases; Ricchetti 
and Buc, The EMBO J. 9:1583-1593, (1990); or template- 
independent ligation products due to blunt end ligation; 
Hayashi et al., Nucl. Acids Res. 14:7617-7631, (1986); 
can lead to a false signal if not adequately suppressed 
10 in the reaction. Such misincorporation is especially 
apparent when the correct, complementary nucleotide 
substrate is absent from the reaction. The polymerase 
chain reaction is quite dependent upon products 
generated during the first few rounds of amplification. 
15 • The difficulty of devising conditions that totally 

suppress amplification by the primer that contains a 
mismatched base at the 3 1 end is also documented in 
Chehab and Kan; Proc. Natl. Acad. Sci . 86:9181 (1989); 
where fluorescence values as high as 0.8 were considered 
20 negative for they were less than 1.0, while all values 
above 1.0 were considered positive for amplif ication 
(even values as low as 1.4 were considered positive). 

In the technique described in Mundy, U.S. Pat. No. 
4,656,127, specific mutations can be detected by first 
25 hybridizing a labeled DNA probe to the target nucleic 
acid in order to form a hybrid in which the 3 f end of 
the probe is positioned adjacent to the specific base 
being analyzed. Then, a DNA polymerase is used to add a 
nucleotide analog, such as a thionucleotide, to the 
30 probe strand, but only if the analog is complementary to 
the specific base being analyzed. Finally, the probe- 
target hybrid is treated with exonuclease III. If the 
nucleotide analog has been incorporated, the labeled 
probe is protected from nuclease digestion. Absence of 
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a labeled probe indicates that the analog and the 
specific base being analyzed were not complementary. 
For this technique , the nucleotide analog may be given 
as the sole substrate for incorporation, or it may be 
5 added as one of a maximum of three nucleotide 

substrates. It is critical to note that all four bases 
cannot be given as substrate, since this would allow 
chain elongation to occur until eventually the modified 
nucleotide analog would be complementary and thus 
10 incorporated. This limits the utility of the assay to 
certain DNA sequences that can have the appropriate 
positioning of primers . For it is well known that if 
the correct base is not supplied in a reaction that a 
"wobble" base pair between G and T will occur to a 
15 significant degree (Boosalis et al., J. of Biol. Chem. 
262:14689-14 696, 1987) . Unlike the reaction described 
in Mundy, the reaction of the present invention can 
include all four nucleotide substrates, thus leading to 
a lower incorporation of non-complementary nucleotide. 
20 A similar technique described by Sokolov; Nucl . 

Acids. Res. 18:3671, (1989); utilizes an unlabeled 
oligonucleotide probe that is perfectly complementary to 
the target sequence and again positioned with the 3- 
prime end of the probe adjacent to the nucleotide 
25 position of interest. Four separate reactions are 

performed, each with only one of the four radioactively 
labeled dNTP's (dATP, dGTP, dCTP, dTTP) supplied as 
substrate for a primer extension reaction using Taq- 
polymerase. Reaction products are run on a standard 
30 polyacrylamide sequencing gel, and the level of 

nucleotide incorporation in each of the four reactions 
is monitored by autoradiography. In this manner, the 
specific nucleotide at the position of interest is 
indicated. 
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The present invention solves several problems 
inherent in the Sokolov method. (1) This invention is 
not dependent upon radioactive substrates nor the time- 
consuming monitoring of the assay via a polyacrylamide 
5 gel and subsequent autoradiography. (2) This invention 
uses chain-terminating nucleotides as substrates in the 
reaction f therefore preventing incorporation of several 
of the same nucleotide in the primer extension product 
if there are several of the same nucleotides present in 
10 a row on the template. (3) The analysis of Sokolov 
required four separate reactions whereas the present 
invention would need only one reaction to gain the same 
amount of information. (4) As mentioned above, if the 
correct, complementary nucleotide substrate is not 
15 present in the reaction, then significant 

misincorporation can occur in the Sokolov reaction. 
Misincorporation is substantially prevented in the 
present invention. 

Automation of the ASPCR reaction was described but 
20 not demonstrated in Wu et al., Proc. Natl. Acad. Sci. 

86:2757-2760, (1989), and again by Chehab and Kan, Proc. 
Natl. Acad. Sci. 86:9178-9182, (1989), for fluorescent 
ASPCR. In both cases, each ASPCR reaction is performed 
using one biotin-labeled primer and one f luorescently- 
25 labeled primer. The biotinylated, double-stranded 
amplif ication products are then separated from 
unincorporated fluorescent primer using streptavidin 
coated magnetic beads. The color of the amplified DNA 
would then be determined f luorometricly through a fiber 
30 optic bundle, or alternatively, by separation and 

detection on a sequencing gel as is currently performed 
for DNA sequencing using f luorescently labeled primers. 
The differences between this method and that of the 
present invention are significant. Most importantly, 



WO 92/16657 



PCT/US92/01691 



10 



the reaction products due to amplification from the 
"wrong" primer are also biotinylated and will be 
retained in the capture process. Furthermore, these 
products due to mis incorporation will electrophorese to 
5 the same position in the sequencing gel, thus 

interfering with the analysis. Finally, the size of the 
amplified products will in general be larger than an 
oligonucleotide, thus requiring a longer time for gel 
electrophoresis and detection than that of the present 
10 invention. 

In order to be useful for a wide variety of 
applications, a technique that identifies a nucleotide 
at a given position in a nucleic acid sample should be 
fast, simple, reliable, and avoid radioactive or nucleic 
15 acid cleaving compounds. The currently available 

detection techniques discussed above are deficient in 
one or more of these areas. All of these problems are 
overcome by the present invention. 

The process of the present invention exploits some 
20 of the same principles and advantages as described for 
sequencing of nucleic acids using fluorescent 
dideoxynucleotide substrates (Prober et al., EPO 252638; 
Mitchell and Merril WO 89/12063; Innis et al., 
WO 9003442) . It is, however, different from that 
25 process in that one of the components of the process 

namely the dNTP substrates allowing multiple lengths of 
primer elongation product is missing in this invention. 
The method of the present invention generally requires a 
knowledge of the nucleic acid sequence in the region of 
30 interest, but it does not require the mutation to be at 
a restriction enzyme cleavage site. The method is 
capable of giving unambiguous results. In the preferred 
forms, it does not require tedious preliminary steps 
that characterize prior methods and indeed lends itself 
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to automation due to the lack of centrifugation steps 
and the ability to quickly assay reaction products 
without a gel separation step* If a gel separation 
assay is used, then multiplexing of samples based on 
5 differences in length of the probe oligonucleotide is 
possible. Alternatively, the probe oligonucleotide of 
several reactions can be of the same length, but loaded 
at different times after pausing the electrophoresis 
run* 

10 SUMMARY OF THE INVENTION 

The present invention provides a process for 
identifying the nucleotide present at a specific 
position in a* nucleic acid sequence. It is based upon 
the selective attachment of one of four chain- 

15 terminating nucleotides, that are detectably labeled and 
distinguishable, onto a probe in a complementary, 
template dependent fashion. The probe is designed to 
selectively hybridize to a target nucleotide sequence 
and oriented such that a one nucleotide extension of the 

20 probe, usually in the 3-prime direction, will base pair 
to the nucleotide position of interest- The 
oligonucleotide probe, the nucleic acid containing the 
target nucleotide sequence, or both, may contain a site 
for specific immobilization to facilitate separation 

25 from unincorporated nucleotides and primers, such that 
the labeled nucleotides incorporated into the reaction 
product can be measured without use of a gel system such 
as agarose or acrylamide. 

Thus the present invention provides a method for the 

30 identification of the nucleotide present at a single, 

defined position in the nucleic acid which comprises the 
following steps : 

<a) contacting a nucleic acid analyte with a probe 
oligonucleotide of sufficient length and appropriate 
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sequence under conditions sufficient for the probe to 
bind preferentially to a target nucleotide sequence and 
form a hybrid having a double-stranded portion including 
the 3-prime end of the probe. The nucleotide position 
5 of interest is the first base of the nucleic acid 

analyte which extends in a 3 1 to 5 ■ direction beyond the 
3' end of the probe nucleotide sequence and is 
immediately adjacent to the hybrid formed (Figure 1) . 

(b) extending the probe oligonucleotide strand of 
10 the hybrid beyond its 3 f end in the 5 1 to 3' direction 

by enzymatic addition of a detectably labeled, chain- 
terminating nucleotide which is complementary to the 
nucleotide position of interest; 

(c) detecting whether a detectably-labeled chain- 
15 terminating nucleotide has been incorporated to 

determine which of the complementary nucleotide base(s) 
is present at the target nucleotide position of 
interest . 

(d) identifying the nucleotide of interest as the 
20 nucleotide complementary to the incorporated chain- 
terminating nucleotide . 

The present invention also provides a kit. The kit 
includes reagents in packaged form. The package may 
include extension primers, a probe polynucleotide, 

25 chain-terminating nucleotides and extension enzymes. 
The package may include attachment moieties and solid 
supports. Any of the reagents may include attachment 
moieties. A package may include any combination of 
reagents as necessary for a particular purpose. A 

30 package may include an insert such as a standard or 
direction for handling specific reagents. 
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DESCRIPTION OF FIGURES 
Figure 1, comprising Figures la-lh, illustrate in 
various schematic forms, the location of various 
components of the process of this invention. 
5 Figure la illustrates an analyte strand (An) which 

contains the nucleotide position of interest (N) , the 
identity of which is to be determined by the assay. A 
target nucleotide sequence (TNS) immediately 3* of, but 
not including the nucleotide position of interest is 

10 illustrated. A double strand nucleic acid region forms 
when a probe binds to analyte strand An by complementary 
base pairing to the target nucleotide sequence TNS. 

Figure lb illustrates the incorporation of a chain 
terminating nucleotide (N*) complementary to the 

15 nucleotide of interest (N) after contacting the double 
stranded region in Figure la with a polymerase capable 
of primer extension. (The * in this and subsequent 
figures is used to illustrate a detectable label 
attached to the nucleotide) . 

20 Figure lc illustrates the same features as Figure 

la, but with a specific example showing the nucleotide 
of interest as a thymidine (T) . 

Figure Id illustrates the same features as Figure 
lb, but using the same specific example as Figure lc, to 

25 show the result of enzymatic incorporation of a 

detectably labeled adenosine at the 3' terminus of the 
probe as the nucleotide complementary to the nucleotide 
of interest (thymidine). 

Figure le illustrates the incorporation of a 

30 detectably labeled guanosine at the 3 1 terminus of the 
probe and complementary to the nucleotide of interest 
(cytidine) . 

Figure If illustrates the use of another analyte 
strand for the assay (the complementary strand of the 
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analyte strand shown in Figure 1c) . As in prior 
figures r the target nucleotide sequence is chosen to be 
immediately 3 f of the nucleotide of interest (in this 
example shown as an adenosine) r and the probe is 
5 complementary to the target nucleotide sequence. 

Figure Ig illustrates the incorporation of a 
detectably labeled thymidine at the 3' terminus of the 
probe and complementary to the nucleotide of interest 
(adenosine) . 

10 Figure lh illustrates the incorporation of a 

detectably labeled cytidine at the 3' terminus of the 
probe and complementary to the nucleotide of interest 
(guanosine) . 

Figure 2 illustrates an example of steps that can be 
15 used in the practice of this invention when it is 

desired to use a nucleic- acid strand immobilized on a 
solid support as the analyte strand. 

Figure 3 illustrates an example of steps that can be 
used in the practice of this invention when it is 
20 desired to use a nucleic acid strand in solution as the 
analyte strand. 

Figure 4 illustrates identification of the nucletide 
of interest (N) when it is located at the 5' terminus of 
the analyte strand. 
25 Figure 5 illustrates identification of a difference 

between two analyte strands, when that difference is 
part of a nucleotide insertion or deletion. 

Figure 6 illustrates how the number of assays must 
increase if the number of distinguishably labeled chain 
30 terminating nucleotides in the reaction are decreased, 
if the identity of all possible nucleotides at the 
position of interest is to be determined. 

Figure 7 illustrates the output signals obtained 
when the detectably labeled substrates used in the 
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examples are detected with the Genesis 2000 DNA analysis 
system. 

Figure 7a illustrates the output signal [ratio of 
the green line to red line peak height, +/- one standard 
deviation] of data obtained as in Figure 7b and 7c for 
each of the four detectably labeled nucleotides used in 
the examples detected either through a gel or through a 
capillary. 

Figure 7b is representative data showing the 
position and relative peak heights of the two 
photomultiplier tube signals (red and green lines) when 
SF-ddGTP-505 or SF-ddTTP-52 6 are electrophoresed through 
a urea-polyacrylamide slab gel mounted on the Genesis 
2000. 

Figure 7c illustrates the output signal obtained 
when SF-ddGTP-505 or SF-ddCTP-519 are each passed four 
times (therefore four peaks) through an empty capillary 
mounted for detection on the Genesis 2000 unit. 

Figure 8 illustrates the double stranded portion of 
the mouse RNA polymerase II gene that was amplified 
using PCR primer 1 and PCR primer 2, as well as the 
position and sequence of the various oligonucleotide 
probes used in Examples 1-5. 

Figure 9 illustrates the sequence of the Wildtype 
and the Mutant allele of the RNA polymerase II gene 
between nucleotides 5395 and 5454, with the difference 
between the two alleles indicated by boldface type at 
position 5430. 

Figure 10 illustrates the data obtained in Example 
1: Incorporation of either labeled SF-ddATP-512, SF- 
ddGTP-505, or both in approximately equal amounts, whetn 
probe A is used on nucleic acid samples known to be 
either Wildtype, Mutant, or Heterozygous at nucleotide 
position 5430 of the RNA polymerase II gene. 



WO 92/16657 



PCT/US92/01691 



Figure 11 illustrates the data obtained in Example 
2: Incorporation of either labeled SF-ddTTP-526 or 
SF-ddCTP-519 when probe B is used on nucleic acid 
samples known to be either Wildtype or Mutant at 
5 nucleotide position 5430 of the RNA polymerase II gene. 

Figure 12 illustrates the data obtained in Example 
3: The level of misincorporation that may occur if the 
correct , complementary nucleotide is not included in the 
reaction . 

10 Figure 13 illustrates the data obtained in Example 

4: Correct incorporation of SF-ddTTP-52 6 in the 
Wildtype allele and SF-ddCTP-519 in the Mutant allele in 
the presence of all four, distinguishably labeled ddNTPs 
in the reaction at lower nucleotide concentrations than 

15 in previous examples. 

Figure 14 illustrates the double stranded nucleic 
acid region of the Wildtype Al gene of Z<gg mays that is 
amplified using PCR primers A and B, with primer C 
indicating the sequence and position of the 

20 oligonucleotide probe used in Examples 5-6. 

Figure 15 illustrates the data of Example 5: 
Correct incorporation of SF-ddGTP-505 when a mutant 
allele, a-dt . of the maize Al gene (carrying a G-C base 
pair rather than C-G base pair) is assayed. 

25 Figure 16 illustrates the data of Example 6: Use of 

a modified T7 polymerase (Sequenase) , to give the 
correct incorporation of approximately equal amounts of 
SF-ddCTP and SF-ddGTP in a sample heterozygous for the 
maize allele. 

30 DETAILED DESCRIPTI ON OF THE INVENTION 

The present invention discloses a process for 
identifying the nucleotide present at defined nucleotide 
position in a nucleic acid sequence. This process has 
utility as a rapid, convenient means to genotype a 
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biological sample with respect to specific, nucleic acid 
sequence information (e.g. nucleotide positions 
correlated with phenotypic differences among individuals 
between species or in tissues) . Single base pair 
5 mutations such as transitions, transversions, insertion, 
deletion as well as more complex rearrangements can be 
assayed using the method of the present invention if the 
appropriate oligonucleotide probe is designed (Figure 
5) . 

10 The presence of a target nucleic acid in a 

biological sample may be detected generally as the 
presence or absence of an incorporated nucleotide. 
Individual- nucleotides located at selected sites in the 
nucleic acid sample may also be identified. The method 
15 presented here is generally applicable to all nucleic 

acid sequences (DNA or RNA) f whether they aire single or 
double stranded, as long as the target nucleic acid 
strand is of sufficient length to form a hybrid with a 
complementary, oligonucleotide probe. Any source of 
20 nucleic acid, in purified or nonpurified form can be 
utilized as the starting nucleic acid or acids, if it 
contains, or is suspected of containing, the target 
nucleic acid sequence. The target nucleic acid can be 
only a fraction of a larger molecule or can be present 
25 initially as a discrete molecule. Additionally, the 
target nucleic acid may constitute the entire nucleic 
acid or may be a fraction of a complex mixture of 
nucleic acids . 

The method of this invention requires formation of a 
30 hybrid between an oligonucleotide primer preferred to 
herein as the oligonucleotide probe) and the target 
nucleic acid sequence. Probes of relatively short length 
(e.g. 10-100 nucleotides) are preferred in that they can 
be chemically synthesized. The probe can consist of 
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DNA, UNA, a contiguous DNA-RNA polynucleotide r or a 
nucleic acid chain containing one or more modified 
nucleotides. Under the appropriate conditions the probe 
should selectively form a hybrid with the target 
5 nucleotide sequence. The probe of this invention 
terminates one nucleotide prior to the position of 
interest such that the first nucleotide to be added to 
the 3 f terminus of the oligonucleotide probe in a 
template-dependent, primer extension reaction will be a 

10 nucleotide complementary to the nucleotide position of 
interest (Figure 1) . 

In practice, the specific target nucleic acid of a 
biological sample must be present in sufficient quantity 
such that hybrid molecules formed with the probe 

15 oligonucleotide are detectable by the label incorporated 
onto the probe. 

Some samples may contain a sufficient number of 
target nucleic acid strands, but other samples may not. 
For these, molecular cloning of a region surrounding the 

20 nucleotide position of interest would suffice as a means 
to increase the number of target molecules, but it is 
tedious and time-consuming. For a description of 
methods to clone nucleic acid fragments see Sambrook, J . 
et al., 1989, Molecular Cloninc r; A Laboratory Manual. A 

25 linear amplification of the target nucleic acid sequence 
using multiple rounds of primer extension; Proudfoot 
et al., Science 209:1329-1336, 1980; or the exponential 
amplification as described in detail in Mullis, et al., 
U.S. Pat. No. 4,683,195 and Mullis, U.S. Pat. No. 

30 4,683,202 may also be utilized. It is necessary to 
either remove or inactivate the unincorporated 
nucleotides and primers of the amplification reaction. 
Non-labeled, unincorporated nucleotides will allow 
primer extension to occur beyond the nucleotide of 
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interest. Similarly, amplification primers that are 
free in solution can be extended and provide 
incorporation at positions other than the position of 
interest. Various methods obvious to those skilled in 
5 the art of molecular biology are available for removing 
unincorporated nucleotides and primers. However, since 
we desire a method that is rapid and automatable, the 
preferred form of separation is one utilizing attachment 
of the amplified nucleic acid product to a solid support 
10 with subsequent washing steps* An avidin-biotin system 
is preferred. 

The template may be RNA or PNA, and may be double or 
single stranded. If double stranded, it is necessary to 
denature the strands to allow hybridization between the 

15 template strand and the oligonucleotide probe. Methods 
for this denaturation and subsequent hybridization step 
are well known to those skilled in the art of 
sequencing. However, since it is well known that 
formation of the hybrid between the oligonucleotide 

20 probe and the nucleic acid strand containing the target 
nucleic acid sequence can be inhibited by the 
complementary, non-template strand, the preferred method 
is to physically separate the template and the non- 
template strand after a denaturation step. The template 

25 strand can either be the strand present on the solid 

support (Figure 2) , or a strand that is free in solution 
(Figure 3) . 

By design of an appropriate probe and utilizing the 
appropriate nucleic acid strand as template, essentially 
30 all nucleotide positions, even those at the end of a 

linear nucleic acid molecule can be assayed (Figure 4) . 
In practice, since amplification from a complex nucleic 
acid mixture will at times give several different 
amplif ication products, the preferred method is to 
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utilize an oligonucleotide probe that is different from 
the primer (s) used in the amplification process. 

The probe that has formed a duplex (hybrid) within 
the template is then subjected to enzymatic primer 
5 extension with enzyme such as primer-dependent DNA 

Polymerases and Viral Reverse Transcriptases , including 
AMV Reverse Transcriptase, various eukaryotic primer- 
dependent DNA Polymerase and DNA Polymerase I from £. 
coli (Klenow fragment) . The basic elements required for 

10 execution of primer extension reactions are reviewed iri 
Mullis et al., U.S. Pat. No, 4,683,195 and Mullis, U.S. 
Pat. No. 4,683,202, and include definition of a primer, 
size of primers, preparation of oligonucleotide primers, 
methods for separating strands of double stranded 

15 .nucleic acid, preferable ratio of primer to template, 
conditions for mixing and annealing primer to template 
strand, and conditions for extending the primer in a 5 ' 
to 3* direction. 

The enzyme used in the primer extension reaction 

20 should not exhibit exonuclease activity upon the 

components of the reaction. For example, either the 
enzyme should be free of 3' to 5' exonuclease activity 
or the probe should be of such composition as to resist 
such a degradation activity. Examples of this patent 

25 were performed under the former condition. 

Adaptations of and alternatives to the primer 
extension technique can also be used with the process of 
the present invention. Double stranded nucleic acid 
targets can be used to generate both the template and 

30 primer strands, thereby eliminating the primer-template 
annealing step. By enzymatic or chemical treatment of 
the double stranded nucleic acid, molecules can be 
produced that have a recessed 3* strand and an 
overhanging 5 1 strand and thus are substrates for 
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nucleotide addition by a DNA polymerase. For example, 
cleavage of DNA with many restriction enzymes generates 
5 f overhangs that are substrates for DNA polymerases. 
Also, there are 3* exonucleases that remove 3' 
5 nucleotides from double-stranded DNA, producing 

molecules with 3' recessed strands and 5' overhanging 
strands. 

The hybridizing and extending steps can be performed 
in solution or in solid phase reactions . The detection 

10 can also be in solution, after attachment to a solid 

phase, or after passing through a gel such as acrylamide 
or agarose. However, as previously mentioned, the first 
two methods are preferred for they avoid the time- 
consuming gel assay. Without a gel assay, it is 

15 necessary to separate the unincorporated labeled chain- 
terminators after the elongation step. Note that in the 
present invention, it is not necessary to wash away the 
excess oligonucleotide probe that did not hybridize, 
since the unextended probe does not contain a labeil. 

20 There are four general forms of such separation: (1) 

immobilizing the elongated probe or hybrid selectively 
(e.g. by attaching to a binding segment on the analyte 
strand or on the probe) and separating away 
unincorporated, labeled nucleotide substrate together 

25 with sample polynucleotides that probe did not bind to; 

(2) immobilizing the elongated probe or hybrid non- 
selectively with other polynucleotides and separating 
away the unincorporated, labeled nucleotide substrate; 

(3) separating the unincorporated, labeled nucleotide 
30 substrate without immobilizing the elongated probe or 

hybrid, and (4) inactivating the label associated with 
unincorporated nucleotide substrate such that it is no 
longer detectable by the assay method employed. 



WO 92/16657 



PCT/US92/01691 



22 

Form (1) is the preferred method for it offers 
improved specificity and signal concentration in that a 
binding group can be captured specifically by a solid 
phase material. For example, a pendant biotin or 
5 biotin-dUTP incorporated into the probe can be 

specifically captured by avidin-coated materials such as 
avidin-agarose, avidin-coated magnetic beads, or avidin- 
coated microtiter wells. Another example might be the 
use of an oligonucleotide probe with a 5' -extension that 
10 is nonhomologous to the target sequence. This portion 
of the probe can then be used to capture the elongated 
probe (or hybrid) to a solid support that contains the 
complementary sequence. In such case, elongated probe 
or hybrid can be captured specifically and in high 
15 concentration on the solid phase, with the major other 
material captured (unhybridized probe) not causing non- 
specific signal. 

If the capture onto solid support is due to a 
binding system present on the probe, then it should be 
20 apparent that any label generated in the assay from 

primers other than the probe would not be captured and 
therefore absent when analysis is performed. 

Form (3) is also superior to many conventional probe 
assays where the probe is labeled before the elongation 
25 step, since separating a labeled oligonucleotide probe 
from a labeled, but short primer-elongation product is 
more difficult than separating the same labeled, primer- 
elongation product from the unincorporated, labeled 
nucleotide substrates . 
30 Once the elongated probe is isolated from 

unincorporated, labeled nucleotide substrate, detection 
can proceed in a conventional fashion, either on the 
solid phase or otherwise. It should be apparent that 
the binding system used in forms (1) and (2) of the 
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present method should be independant of the binding 
system used to attach detectable label to the modified 
nucleotides during the detection step. 

Crucial to this invention, are the chain- 
5 terminating, detectably labeled nucleotide substrates. 
Detectably labeled does not mean that the detectable 
signal must be present at the time of incorporation. 
The fluorescent substrates described below require 
activation. Detectably labeled does not necessarily 
10 mean that the nucleotide substrates carry a reporter 
such that there is not only the ability to detect the 
label, but also to identify the nucleotide. If only one 
nucleotide is present in the reaction, then detection of 
incorporation is sufficient for identification. The 
15 modified dideoxy-nucleotide substrates described in 

Prober et . al (EP-A 252683) or the DyeDeoxy terminators 
(a trademark of Applied Biosystems, Inc., Foster City, 
California) are examples of chain-terminating detectably 
labeled nucleotide substrates. However, unlike 
20 sequencing using f luorescently labeled chain-terminating 
nucleotides, there is essentially no requirement in this 
method that each of the modified nucleotides have a 
similar mobility shift when run on a sequencing gel. In 
the preferred embodiment, four chain-terminating 
25 nucleotides that are distinguishably labeled are present 
in each reaction. The need for four different labels is 
eliminated if the number of reactions per sample are 
increased (Figure 6) . Unlike reports in the prior art, 
all four chain-terminating nucleotides may be present in 
30 the initial reaction, but only one must be detectably 
labeled. Unlike nucleic acid sequencing using chain 
terminators, the chain-elongating dNTP substrates are 
not a component of the reaction of the present 
invention. 
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The chain-terminating nucleotides described in the 
present invention are labeled with a fluorescent signal 
generator (reporter) . A suitable fluorescent reporter 
is one that can be detected in its unprotected form at 
5 or below the level of detection that can be quickly 
achieved with 32 P, i.e., about 10~ 14 moles. Specific 
desirable characteristics may include a large 
coefficient of extinction in the region of excitation, a 
high quantum yield, an optimal excitation or emission 
10 wavelength (preferably above 350 nm) f and 

photostability . For example, fluorescent dyes that are 
efficiently excited by an argon laser are desirable 
because of the low cost of this laser. 

Preferably in its unprotected form, the reporter is 
15 a fluroescent dye chosen from the group consisting of 
xanthenes (e.g., fluoresceins, eosins, erythrosins) , 
rhodamines (e.g., tetramethylrhodamine, Texas Red®), 
benzamidizoles , ethidiums , propidiums , anthracyclines , 
mithramycins, acridines, actinomycins, merocyanines, 
20 coumarins (e.g., 4-methyl-7-methoxycoumarin) , pyrenes, 
chrysenes, stilbenes, anthracenes, naphthalenes (e.g., 
dansyl, 5-dimethylamino-l-naphthalenesulf onyl) , 
salicyclic acids, benz-2-oxa-l-diazoles (also known as 
benzofurazans) (e.g., 4-amino-7-nitrobenz-2-oxa-l, 3- 
25 diazole) , and f luorescamine. Useful forms of many of 

these dyes are commercially available. For a review of 
fluorescent dyes used in tagging DNA, see 
A. S. Waggoner, Chapter 1, Applications of Fluorescence 
in the Biomedical Sciences, ed. by D. L. Taylor, et al., 
30 Alan R. Liss, New York (1986) . An extensive description 
of chain terminator labeling is found in U.S. 
Application No. 07/057,566 filed June 12, 1987, 
incorporated herein by reference. 
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The present invention is - further illustrated by 
reference to Figures 1-6. 

In Figure la, an analyte strand (An) contains a 
nucleotide position of interest (N) , the identity of 
5 which is to be determined by the assay, is defined as 
the first base of the analyte nucleic acid strand which 
is beyond the 5* end of the target nucleotide sequence 
in the 3 f to 5 f direction. A probe polynucleotide is 
produced as a reagent having a binding region 
10 complementary to the target nucleotide sequence (TNS) . 
In this particular embodiment, the probe polynucleotide 
consists only of that complementary sequence; in other 
embodiments, the probe is extended in the 5* direction 
in a manner that does not interfere with the recognition 
15 and complementary base pairing to the target nucleotide 
sequence. The diagram in Figure la illustrates the 
double stranded nucleic acid region which forms when the 
probe binds to analyte strand An by complementary base 
pairing to the target nucleotide sequence TNS. 
20 By contacting the double stranded region shown in 

Figure la with a DNA polymerase specific therefore, the 
3 f end of the probe will be utilized as a primer and 
elongated opposite the analyte strand An which serves as 
a template for nucleotide incorporation. As illustrated 
25 in Figure lb, the nucleotide incorporated <N*) will be 
complementary to the nucleotide position of interest 
(N) . In all illustrations, the * symbol is used to 
illustrate a detectable label attached to the 
nucleotide. The enzyme, primer and nucleic acid analyte 
30 are chosen together such that a nucleotide complementary 
to the target nucleotide of interest is incorporated. 
For example, if analyte strand An is DNA, then a reverse 
transcriptase, a primer dependent prokaryotic DNA 
polymerase (e.g. the Klenow fragment of £. coii DNA 
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Polymerase I or TAQ polymerase) or a eukaryotic DNA 
polymerase may be used/ with the probe being DNA or 
RNA. . 

Figures Ic-e are examples to illustrate the more 
5 schematic drawings of Figures la&b. In Figure 1c, the 
target nucleotide of interest is a T. After chain 
elongation of the primer with a DNA polymerase in the 
presence of detectably labeled chain terminators, the 
complementary nucleotide, A*, will be covalently 

10 attached to the primer (Figure Id) . If the nucleotide 
of interest was a C, then the complementary nucleotide 
that is incorporated will be a G* (Figure le) . If the 
nucleic acid sample being analyzed contains molecules of 
several types, then several different nucleotides may be 

15 incorporated and covalently attached to the primer (e.g. 
if both T and C are present in the position of interest 
in a portion of the molecules, then a result corres- 
ponding to Figures ld&e may occur within one sample 
being analyzed) . 

20 Figures lf-h are very similar to Figures lc-e except 

that the opposite nucleic acid strand is utilized as 
template thus the oligonucleotide probe is chosen to 
correspond to a different TNS. 

Figures 2 & 3 illustrate in schematic form the 

25 sequence of events that comprise preferred embodiments 
for carrying out the present invention. In Figure 2 the 
immobilized strand is used as the template (An) , and in 
Figure 3 the eluted or non-immobilized strand is used as 
template- As shown, the polymerase chain reaction (PCR) 

30 may be used to generate the template strand in increased 
quantity before analysis. The removal of the 
unincorporated nucleotides and primers is essential, and 
can be performed by binding the double stranded PCR 
product to a solid support, e.g. by a biotin (B) - 
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streptavidin complex, and rinsing away the unbound 
material. The two strands are then denatured (e.g. by 
addition of NaOH) and only the template strand is 
retained for the reaction. In Figure 2 the immobilized 
5 template strand is. rinsed, while in Figure 3 the 
soluble, eluted strand is used as template after 
neutralizing the NaOH solution. The probe 
oligonucleotide is then hybridized to the template 
strand and the hybridized probe is elongated by addition 
10 of a single, chain terminating nucleotide. The enzyme 
utilized in the reaction is a DNA polymerase such as 
reverse transcriptase and all four chain terminating 
nucleotides may be present, although only one must be 
detectably labeled. The unincorporated nucleotides are 
15 removed from the reaction by washing. Note that in 
Figure 3, the template strand was not previously 
immobilized, so the probe oligonucleotide can now be 
captured onto solid support for efficient washing. The 
nature of the label present on the elongated primer may 
20 be measured directly after efficient removal of the 

unincorporated substrate. That is, the primer may still 
be bound to the solid support, either directly as shown 
in Figure 3 or indirectly through the hybrid formed with 
the analyte strand (Figure 2 without the final 
25 denaturation step). For the particular brand of 

streptavidin coated magnetic beads used in our examples, 
the labeled primer is released from the beads after 
heating in the presence of formamide and EDTA. The 
magnetic beads do not interfere with standard gel 
30 electrophoresis although they are loaded into the sample 
well along with the sample. If the sample is assayed 
through a capillary, then the beads may obstruct flow 
and should be removed. 
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Figure 4 illustrates that the nucleotide of interest 
(N) can even be located at the end of a nucleic acid 
strand. This is different from that of multiple 
nucleotides during the reaction. 
5 Figure 5 illustrates that the assay is useful for 

detecting insertions and deletions as well as the point 
mutations illustrated in Figures lc-h. In Figure 5a , 
two template nucleic acid strands are drawn , with the 
upper strand differing from the lower strand by the 
10 presence of two T*s (note that one strand may be 

considered to have a deletion, or the other strand may 
be considered to have an insertion) . Figures 5b&c 
illustrate one possible choice of probe and the 
resulting difference in the nucleotide incorporated when 
15 the two different strands are used as the template. An. 

Figure € illustrates how the number of assays must 
increase if all four chain terminating nucleotides are 
not detectably labeled and distinguishable one from 
another . 

20 The number of reactions required to identify the 

nucleotide of interest in a given. sample is dependent 
upon how many of the different, possible substrates are 
detectably labeled and distinguishable. 

25 M^t-hori 1 fPrftferrfid Method) 

All possible nucleotide substrates are present in 
reaction and all four are detectably labeled and 
distinguishable from each other (e.g., ddATP*, ddTTP*, 
ddCTP*, ddGTP* are provided as substrate) 
30 One Reaction: 

Result: Only ddATP* is incorporated and 

detected. 
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Conclusion: Since the other nucleotides were not 
detected, only T is present at the 
nucleotide of interest. No other 
reactions are required. 

5 

Method 2 

All possible nucleotide substrates are present in 
each reaction but perhaps only 2 can be detectably 
labeled such that they are distinguishable from each 
10 other. 

1st Reaction: provide substrates ddATP*, ddGTP*, 
ddCTP, ddTTP. 
Result: Only ddATP* is incorporated and 

detected. 

15 Conclusion: T is present at the nucleotide position, 

and C is not present . But do not know 
if either G or A are present . 
2nd Reaction: provide substrates ddCTP*, ddTTP*, 

ddATP, ddGTP. 

20 Result: There was no detectable incorporation. 

Conclusion: G or A are not present at the position 
of interest . 

(To provide evidence to support this further, one could 
use the same substrate mixtures except monitor the 
25 incorporation using the complementary strand as the 

anlayte and the different, but appropriately positioned 
primer.) 

Method 3 

30 All possible nucleotide substrates are present in 

each reaction but perhaps only one label is available 
for substrate labeling (e.g., the same as when 
radioactively labeled ddNTP's are utilized). 
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1st Reaction: 



Result : 



Conclusion 



10 



15 



20 



2nd Reaction: 

Result : 
Conclusion: 

3rd Reaction: 

Result : 
Conclusion: 

4th Reaction: 

Result : 
Conclusion : 



provide substrates ddATP*, ddGTP, 
ddCTP, ddTTP. 

Only ddATP* is incorporated and 
detected. 

T is present at the nucleotide position, 
but after only this first reaction, it 
cannot be stated that another nucleotide 
is not also present, 
provide substrates ddATP*, ddGTP*, 
ddCTP , ddTTP . 

No detectable incorporation. 

C is not present at the nucleotide 

position of interest . 

provide substrates ddATP, ddGTP, 

ddCTP, ddTTP. 

No detectable incorporation. 

G is not at the nucleotide 

position of interest. 

provide substrates ddATP, ddGTP, 

ddCTP, ddTTP*. 

No detectable incorporation. 

A is not at the nucleotide 

position of interest. 



25 EXAMPLES 

The following examples are offered by way of 
illustration and are not intended to limit the invention 
in any manner. 

EXAMPLE 1 

30 Aims : 

1. To demonstrate that the claimed method can be 
used to identify the nucleotide present at a defined 
position on a nucleic acid strand in a single reaction. 
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2. To illustrate that a single f luorescently 
labeled nucleotide can be incorporated in the assay 
using a commercially available enzyme preparation. 

3. To illustrate that the unincorporated labeled 
5 substrate can be efficiently removed without time- 
consuming centrifugation or column chromatography. 

4 . To illustrate the use of a DNA strand labeled at 
the 5* end with Biotin and bound to a solid support as 
the analyte strand. 

10 5 . To illustrate the ability to distinguish between 

three DNA samples by incorporation of an A (wildtype 
allele), a G (mutant allele), or both A & G 
(heterozygote) as the complementary nucleotide opposite 
the nucleotide position of interest, 

15 6. To illustrate that the fluorescent nucleotide 

substrates can be detected and distinguished on the 
Genesis 2000 DNA analysis unit either by gel 
electrophoresis or by passing the sample through a 
capillary. 

20 

Definition of the Nucleotide Position of Interest: 

In this example, the nucleotide position of interest 
is that of the lower strand at nucleotide position 5430 
of the mouse RNA polymerase II largest subunit gene as 

25 described by publication in the GenBank database, 

accession M12130 for the locus RO:Musrpolii2 . A 602 
nucleotide portion of this sequence from nucleotide 4 915 
to 5517 is illustrated in its double stranded form in 
Figure 8, with the nucleotide position of interest for 

30 this example being at position 5430 on the lower strand 
(occupied by a bold-faced T in the sequence of the 
Wildtype allele which is shown in this Figure 8) . 
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npfinition of i-hft Target Nucleotide Sequence: 

in this example/ the target nucleotide sequence 
(TNS) is chosen as the 21 nucleotide sequence 
(3 • TAACGACAACAGCCCGTCGTC5 1 ) that immediately flanks the 
5 nucleotide of interest such that the nucleotide position 
of interest is the next contiguous nucleotide in the 3 ' 
to 5 1 direction on that nucleic acid strand (see Figure 
lc) . 

10 Thg oligonucleotide probe ; 

In this example, the oligonucleotide probe consisted 
of the 21 nucleotide sequence 5' ATTGCTGTTGTCGGGCAGCAG 
3' (probe A of Figures 8 and 9), and is perfectly 
complementary to the target nucleotide sequence defined 

15 above. It is synthesized on an Applied Biosystems DNA 

synthesizer and further purified by HPLC to consist of a 
single oligonucleotide species, 21 nucleotides in 
length. (methods as described in Oligonucleotide 
Synthesis, A Practical Approach ed. M.J. Gait, IRL Press 

20 1984) . 

Starting biological sample: 

In this example, the claimed method will be 
illustrated using three different amplified DNAs in 

25 three separate, but similar reactions. The three 

starting biological materials used for the amplification 
process are each known to contain the mouse RNA 
polymerase II gene. The three samples are designated 
Wildtype, Mutant, and Heterozygote . They are known to 

30 differ at nucleotide position 5430 (as shown in bold 
faced type in Figure 9), with the Wildtype allele 
containing an A-T base pair, the Mutant allele 
containing a G-C base pair, and the Heterozygous sample 
containing an equal mixture of these two alleles. The 
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starting biological materials are obtained from 
J. Corden and are as described in Bartolomei and Corden, 
Molec. and Cell* Biol. 7:586-594, 1987. The Wildtype 
and Mutant alleles are provided as bacterial strains 
5 containing the recombinant plasmids pE2 6-4 and pE26-7 
respectively. The biological sample designated as 
Heterozygous is obtained as a cell line A21. DNA of 
each of the recombinant plasmids is prepared by standard 
molecular biology procedures (described in Sambrook et 
10 al.i- Molecular Cloning: A Laboratory Manual 1989) , and 
genomic DNA is prepared from the A21 cell line as 
described in Corsaro and Pearson, Somatic Cell Genet. 
.7:603-616, 1981 . 

15 Amplifying a segment of DNA containing the nucleotide 

position of interest and the target nucleotide sequence; 

As shown in Figure 8, the target nucleotide sequence 
and the nucleotide position of interest are within a 602 
base pair segment of the RNA polymerase II gene. The 

20 copy number of this segment is increased using 

exponential amplification, using DNA of each of the 
three biological starting materials described above. 
The oligonucleotide primers used for PCR amplification 
of the region of interest in the RNA polymerase II gene 

25 are designated PCR amplification Primer 1 

(5 • CAGACATTTGAGAATCAAGTGAATCG 3 ' ) and PCR amplif ication 
Primer 2 (5' BCTCGGCTCTCAGGACCATAATCAT 3') where 
B=biotin (see Figure 8). They are synthesized by 
standard phosphoramidate chemistry on an Applied 

30 Biosystems DNA synthesizer. For the biotinylated 

primer, the biptin moiety is added at the 5 f end during 
synthesis as described in Cocuzza US patent 4,908,453. 
All such oligonucleotides used in this patent are 
prepared for the inventors by the Du Pont 
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Oligonucleotide Synthesis Facility under the direction 
of C. Burns, although commercial firms are available for 
attaining such items. 

For PCR amplification, thirty-three picomoles of 
5 each primer (one primer biotinylated at the 5 1 end) is 
mixed with approximately 1 ug of genomic DNA (or 0.1 ug 
of plasmid DNA) in a 50 \ll reaction mixture containing 
60 irM KC1, 15 mM Tris (pH8.8), 2.75 roM MgC12, and dATP, 
dGTP, dCTP, and TTP each at 200 uM. The mixture is 
10 incubated at 95°C for 2 min to separate the DNA strands 
and cooled on ice; 2.5 units of TAQ polymerase 
(AmpliTaq, Perkin Elmer Cetus) is added, and the 
reaction mixture is overlaid with approximately 35 Jll of 

mineral oil. The amplification conditions varied 
15 slightly in the course of the experiments, but are 

usually performed in a Perkin-Elmer/Cetus thermal cycler 
using an initial cycle consisting of 4 min 94°C, 45 sec 
55°C, 5 min 68°C, followed by 35 cycles with the same 
parameters except the denaturation at 94°C is 1 min. 
20 Following PCR amplification, 10-15 |Xl aliquots of the 

amplified fragment are run on a 1.5% agarose gel and 
visualized by ethidium bromide staining using standard 
procedures to ensure that an amplified fragment of the 
expected size is produced. Utilizing these primers and 
25 any of the three DNAs described above, a 602 bp PCR 
amplification product of double stranded DNA is 
consistently obtained. Aliquots of the remainder of the 
PCR amplification sample are then utilized in the method 
of this invention. 

30 

preparation of the analvte strand: 

The removal of unincorporated nucleotides and any 
interfering primers before performing the method of this 
invention is essential. In this example, the double- 
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stranded PGR amplification product contains a biotin 
moiety due to the biotin originally presnet on PCR 
amplification Primer 2 . Thus, the separation is done by 
binding the biotinylated PCR amplification product to a 
5 streptavidin-coated solid support and rinsing away the 
non-biotinylated, PCR amplification Primer 1 and the 
unincorporated nucleotides. The support -bound PCR 
amplification product is then denatured using NaOH, the 
complementary, non-analyte strand is removed and the 
10 remaining analyte strand which is still bound to the 
solid support is rinsed and ready for the primer 
extension reaction. These steps are illustrated in the 
schematic drawing of Figure 2 and described below as 
steps 1-6. 

15 1. Magnetic, streptavidin-coated beads from the 

Dynal corporation (Dynabeads M-280 Streptavidin, at 6 x 
108 beads/ml) are washed and resuspended at the same 
concentration in Triton Wash Solution [0.17% (w/v) 
Triton X-100, 100 mM NaCI, 10 mM Tris-HCl pH7.5, 1 mM 

20 EDTA] essentially as described in the Application Brief 
25 for the Genesis 2000 DNA analysis system. 

2. Approximately 20 |ll of double-stranded DNA 

template, amplified using PCR amplification with one of 
the two primers labeled with biotin, are mixed with 20 
25 \ll of washed Dynabeads and incubated at 37°C for 30 

minutes. This mixture is gently shaken intermitently in 
order to keep the magnetic beads in solution, 

3. After this incubation, the tube containing 
magnetic beads and DNA is placed near a magnet to draw 

30 the beads to one side of the tube. After approximately 
four minutes of magnetization, the supernatant is 
removed. 

4. The beads (with DNA bound) are then washed three 
times with TE buffer (10 mM Tris pH8, 1 mM EDTA) using 
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magnetization for removal of the supernatant which 
contains dNTP's and non-biotinylated PCR amplification 
primer. Care is taken that the beads did not dry 
between washes - 

5 5. After the final wash, 16 111 of sterile distilled 

water is added to the bead-bound DNA. 

6. The double-stranded DNA is denatured by addition 
of 4 ill of 0.5M NaOH, 2 mM EDTA solution, and incubated 
at room temperature for 5 min. Afterwards, the sample 
10 is magnetized and the supernatant removed. (The 

supernatant may be kept if the non-bead bound strand is 
to be used as template - e.g. Example 5) . The bead- 
bound DNA pellet is gently resuspended in 100 £11 TE 
buffer to neutralize any NaOH remaining. 

15 

Formation of the an a lyte-probe hybrid and enzvmatic 
extension of the p robe with a chain terminating 
nucleotide complementary to the nucleotide position of 

interest ; 

20 7. To use the bead-bound DNA as template, the TE 

buffer is removed following magnetization, and 7 \il of 

the following solution is added: 
2 \ll sterile water 
1 jil 125 |iM ddTTP (unlabeled) 
25 1 |ll 125 |0M ddCTP (unlabeled) 

1 [ll 6.6 [1M oligonucleotide probe 

2 *il 5X RT buffer [supplied by Invitrogen 

for use with Reverse Transcriptase] 

30 8. Incubate at 50°C for 5 min, then transfer to 

37°C for an additional 7 minutes, and then transfer to 
ice . 

9. The fluorescently-labeled chain terminators and 
the primer-dependent enzyme are then added to the 
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reaction. In this particular Example 2, it is as 
follows : 

1 111 of 125 |1M SF-ddGTP-505 
1 |ll of 125 (1M SF-ddLATP-512 

0.5 \ll Invitrogen Reverse Transcriptase (10|l/|ll) 

10. The labeling reaction is at 42°C for 10 minutes, 
and then the reaction is placed on ice and 100 |ll of TE 

is added. 

11. The sample is again magnetized for 4 minutes and 
the supernatant removed, followed by 3 washes of 100 ill 
TE buffer (magnetization between each wash) to remove 
unincorporated nucleotides. 

12 . The final supernatant is removed and the 
magnetic beads (with DNA bound) are resuspended in 6 \ll 

FE solution (95% formamide, 25mM EDTA) and stored -4°C 
until further use. 

Detection pf the chain terminating nucleotide attached 

to the probe: 

13. The sample from step 12 is diluted 1:16 fold 
further in FE containing crystal violet, for easier 
visualization in loading the sample and to get the 
sample in a reasonable concentration for detection by 
slab gel electrophoresis on the Genesis 2000 DNA 
analysis system (methods as described by the instrument 
documentation, with a few parameters described in more 
detail below) . 

14 . "Lane Finding" for the Genesis detection system 
is performed manually using a primer f luorescently 
labeled at the 3' end prepared in advance using terminal 
transferase and a fluorescent ddNTP as substrate as 
described in Trainor and Jensen, Nucl. Acids Res. 
16:11846, (1988) that is electrophoresed into each 
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sample lane approximately thirty minutes prior to 
briefly pausing the machine and then loading the 
reaction samples into the lanes. Finding lanes in this 
manner allowed the recording of the fluorescence 
5 detection in each lane to begin early enough such that 
any unincorporated labeled substrate remaining in the 
reaction would be recorded. 

15, Just prior to loading 1 |ll of the reaction 

sample onto the gel for electrophoresis and subsequent 

10 detection, it is combined with approximately 1 |ll of a 
shorter, f luorescently-labeled control primer, and 
heated 95°C for 2 minutes. (This control primer is 
added as a mobility standard, but is later found to be 
an unnecessary component and is omitted in later 

15 electrophoresis runs) . 

The first three lanes shown in Figure 10a are 
results from running such samples from each of the three 
reactions. The position of the peak corresponding to 
the elongated probe is designated WT, Mutant , and Het 

20 for the three reactions of this example. In these 
examples, the peak at position S corresponds to the 
control primer that is added at the time of 
electrophoresis. As can be noted, there is very little 
unincorporated fluorescent nucleotide (peak at position 

25 U) remaining in each of the three reactions due to the 
washes of step 11. 

Determining the identity of the chain terminating 
nucleotide t hat is added to the probe: 
30 The fluorescently labeled chain terminators used in 

the examples of this patent are either purchased from 
duPont NEN Biotechnology Systems (Boston , MA) , or 
obtained as a kind gift from Dr. Douglas Amorese of that 
firm. The nature and detection of these SF-ddNTPs are 
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described in Prober et al. (1988) Science 238, 336-341. 
In brief , the chain terminators are distinguished by a 
ratio of the measured fluorescence from two 
photomultiplier tubes (PMT) . Each PMT value is 
5 displayed as either a red or green mark on the output 
computer monitor r with a sample forming a peak as it 
passes by the excitory laser. (In the black and white 
Figures required for this patent application/ the 
original color of each line of the sample peak is 
10 indicated) . Unlike the normal method of fluorescent 
base detection when multiple peaks of a sequencing 
reaction are being analyzed, the commercial Genesis 2000 
software is unable to determine the identity of the 
fluorescent nucleotides (base call) in this application, 
15 for* only a few peaks are present in the lane. It is 

therefore necessary to prepare a one time calibration on 
the instrument by preparing a set of expected values for 
the fluorescently labeled chain terminators at various 
dilutions in FE (95% formamide, 25mM EDTA) . It is 
20 important to note that the commercial instrument is 

designed to have a non-linear response of the two PMTs 
when the voltage is too high. We experimentally 
determined that the ratio obtained for the green peak 
height to the red peak height for a given fluroescent 
25 substrate is relatively invariant from experiment to 
experiment over the range of 0.1-9 volts. Thus the 
green to red ratio of a peak is only determined if the 
reaction samples are within this voltage range. 

The result of such a calibration (+/- one standard 
30 deviation) is shown in Figure 7a by two different assay 
methods. The samples are either electrophoresed on a 
urea-polyacrylamide gel by standard gel electrophoresis 
procedures for the Genesis 2000, or syringe-loaded into 
a single, empty capillary (Part # TSP530700 from 
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Polymicro Technologies, Phoenix, A2) positioned in front 
of the excitatory laser beam on a Genesis 2000 unit. A 
photo and full description of this modified Genesis 
apparatus is given in Zagursky and McCormick, 
5 Biotechniques 9:74-79 (1990). (Care should be taken 

during capillary alignment to avoid electrical shock or 
direct eye contact with the laser beam, ) An example of 
the type of data collected using standard gel 
electrophoresis methods on the Genesis is shown in 
10 Figure 7b for SF-ddGTP-505 and SF-ddTTP-526 . Figure 7c 
illustrates the type of measurement made when the 
fluorescent substrate SF-ddGTP-505 or SF-ddCTP-519 is 
loaded via syringe into the capillary mounted onto the 
Genesis 2000 detection system. The multiple peaks 
15 represent the same sample being pushed several times in 
front of the laser beam. Although the green/red ratio 
for a particular fluorescent substrate is different from 
that of Figure 7b, the nucleotides can be distinguished 
in this new detection system at concentrations similar 
20 to that of gel electrophoresis as illustrated in Figure 
7a. (However, since the rate at which the sample is 
manually pushed in front of the detection system is not 
uniform, the overall peak height is not reproducible in 
this experiment) . 
25 Regardless of whether the sample is electrophoresed 

through a gel matrix or pushed through a capillary, the 
SF-ddNTPs are distinguishable (Figure 7a) . 

16. To determine the identity of the chain 
terminating nucleotide attached to the probe, the PMT 
30 ratio (green/red peak height) is measured for each 
sample using the position of the initial rise of the 
peak as the baseline. 

For accurate determination of this ratio, the three 
reaction samples of this example (i.e. the WT, Mutant, 



WO 92/16657 



PCT/US92/01691 



41 

and Het peak) are rerun at lower dilution (since the 
voltage of two of them are originally too high as shown 
in Figure 10a) . The resulting sample peaks are 
displayed in Figure 10b with a smaller display window 
5 for easier measurement. In this example, the measured 
green/red ratios are as follows: 

WT = 1 .55 Mutant =2.5 HET « 1.9 

10 A comparison of these values to the calibration 

shown in Figure 7a illustrates that these ratios 
correspond to the expected incorporation of SF-ddATP-512 
when the Wildtype allele is the source of the analyte 
strand, incorporation of SF-ddGTP-505 when the Mutant 

15 allele is the source of the analyte strand, and a 

mixture of both fluorescent nucleotides in approximately 
equal proportions when the analyte strand is derived 
from a heterozygous source (i.e. approximately equal 
number of Wildtype and Mutant analyte strands) . 

20 

Identifying the nucleotide of interest as fr.-h* nucleotide 
complementary to the chain terminating n ucleotide which 
is added: 

17. The nucleotide at the position of interest is 
25 the nucleotide complementary to the nucleotide that is 
incorporated. 

Therefore, the conclusion for the three samples of 
this example are as expected: 

The reaction performed on the Wildtype allele 
30 indicates that it does contain a thymidine (T) on the 
lower strand at nucleotide position 5430 of the mouse 
RNA polymerase II largest subunit gene, for the 
nucleotide incorporated is SF-ddATP-512 , The reaction 
performed on the Mutant allele indicates that it does 
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contain a cytosine (C) on the lower strand at nucleotide 
position 5430 of the mouse RNA polymerase II largest 
subunit gene f for the nucleotide incorporated is 
SF-ddGTP-505. The reaction performed on DNA originating 
5 from the A21 cell line (which is known to be 

heterozygous for the wildtype and mutant allele) 
contains an equal number of thymidine (T) and cytosine 
(C) residues at the position of interest/ for an 
approximately equal number of SF-ddATP-512 and SF-ddGTP- 
10 505 are incorporated onto the probe. 

EXAMPLE 2 

Aims : 

1. To illustrate the ability to incorporate and 
15 distinguish a f luorescently labeled SF-ddCTP-519 and 

SF-ddTTP-526 in the practice of this invention. 

2. To illustrate the ability to use the 
complementary strand to that used in Example 1 as the 
bead-bound analyte strand and to illustrate the use of 

20 another oligonucleotide sequence as the probe (probe B 
of Figure 8) . 

Definition of the Nun leotide Position of Interest: 

In this example, the nucleotide position of interest 

25 is that of the upper strand at nucleotide position 5430 
of the mouse RNA polymerase II largest subunit gene as 
described by* publication in the GenBank database, 
accession M12130 for the locus R0:Musrpolii2 . A 602 
nucleotide portion of this sequence from nucleotide 4915 

30 to 5517 is illustrated in its double stranded form in 
Figure 8, with the nucleotide position of interest for 
this example being at position 5430 on the upper strand 
(occupied by a bold-faceted A in the sequence of the 
Wildtype allele which is shown in this Figure 8) . 
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Definition of the Target Nucleotide Sequence: 

In this example, the target nucleotide sequence 
(TNS) is chosen as the 21 nucleotide sequence 
. 5 (5 f ATGTAGAGGGCAAGCGGATCC3 1 ) that immediately flanks the 
nucleotide of interest such that the nucleotide position 
of interest is the next contiguous nucleotide in the 3' 
to 5* direction on that nucleic acid strand (see also 
Figure If) • 

10 

The oligonucleotide probe: 

In this example, the oligonucleotide probe consisted 
of the 21 nucleotide sequence 5' GGATCCGCTTGCCCTCTACAT 
3 f (probe B of Figures 8 and 9), and is perfectly 
15 complementary to the target nucleotide sequence defined 
above. Synthesis and purification is as described in 
Example 1 . 

Starting biological sample: 
20 In this example, the claimed method will be 

illustrated using two of the same starting biological 
samples as described in Example 1: that of the Wildtype 
and Mutant. They are prepared as described in Example 
1. 

25 

Amplifying a segment of PNA containing the nucleotide 

position of interest and the target nucleotide sequence: 

The region of interest is amplified from the 
Wildtype and Mutant samples using methods as described 
30 in Example 1 with PCR amplification Primer 1 and PCR 
amplification Primer 2, except in this Example 2, the . 
PCR amplification Primer 1 is biotinylated at the 5' end 
and the PCR amplif icatiori Primer 2 is not . 



WO 92/16657 



PCT/US92/01691 



44 

Preparatio n of the analyse -strand: 

The methods are as described in Example 1 steps 1-6, 
however in this Example, it is the complementary strand 
which is bound to the solid support, for this strand 
5 contains the biotin from the PCR amplification reaction. 

The invention is practiced as in the steps of 
Example 1 on the Wildtype and Mutant analyte strands 
with the following exceptions: 

a) The two unlabeled nucleotide substrates in step 
7 are ddGTP and ddATP . 

b) The two f luorescently labeled nucleotide 
substrates in step 9 are 1 *il of 30 uM SF-ddCTP-519 and 

I |11 of 125 |!M SF-ddTTP-52 6. 
The results shown in Figure 11 illustrate that the 

Wildtype and Mutant allele have green/red ratios of 0.35 
and 0-7 respectively when probe B is used. The 
calibration graph of Figure 7a shows that this 
corresponds to the incorporation of SF-ddTTP-526 and 
SF-ddCTP-519 for the Wildtype and Mutant allele. 

THE CONCLUSION for the two samples of this example 
are as expected (see Figure 9) : 

The reaction performed on the Wildtype allele indicates 
25 that it does contain a adenine (A) on the upper strand 
at nucleotide position 5430 of the mouse RNA polymerase 

II largest subunit gene, for the nucleotide incorporated 
is SF-ddTTP-526. The reaction performed on the Mutant 
allele indicates that it does contain a guanine (G) on 

30 the upper strand at nucleotide position 5430 of the 

mouse RNA polymerase II largest subunit gene, for the 
nucleotide incorporated is SF-ddCTP-519. 
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EXAMPLE 3 

Aim: 

1. To illustrate that under the conditions of 
Examples 1 and 2, that in some cases , the wrong 
5 nucleotide will be incorporated if the correct 

nucleotide is missing from the reaction (i.e. the 
problem with many of the assays discussed in the prior 
art is that of significant misincorporation in reactions 
where the correct nucleotide is not provided. 
10 For this example, the same two reactions as 

described in Example 2 are performed with the following 
exceptions : 

a) For the reaction with the Wildtype allele in 
step 9 the SF-ddTTP-526 is omitted and the only 

15 fluorescent substrate in the reaction is 1 |ll of 125 |1M 

SF-ddCTP-519. (Unlabeled ddTTP is also absent in the 
reaction) . 

b) For the reaction with the Mutant allele in step 
9 the SF-ddCTP-519 is omitted and the only fluorescent 

20 substrate in the reaction is 1 |ll of 125 JiM 

SF-ddTTP-526. (Unlabeled ddCTP is also absent in the 
reaction) . 

The results shown in Figure 12 illustrate that for 
the Wildtype allele (upper panel) , there is no 

25 significant misincorporation of SF-ddCTP-519 as a 
complementary base for the adenine (A) present as 
template for the primer extension. This is true 
although the SF-ddCTP-519 is present at a higher 
concentration than in Example 2. No conclusions can be 

30 made with respect to whether A or G is incorporated, for 
these nucleotides although present, are not 
fluorescently labeled in the reaction. The lower panel 
of Figure 12 illustrates a significant level of 
misincorporation of SF-ddTTP-526 as a complementary base 
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for the guanine (G) present on the Mutant analyte strand 
for the primer extension of probe B (refer to Figure 9) . 

These results suggest that in some cases (e.g. as in 
the upper panel of Figure 12) there will be very little 
5 misincorporation when the correct, complementary base 
is not provided in the reaction. Therefore, in such 
cases, the method of the present invention can be 
performed without all four bases present. However, the 
lower panel of Figure 12 clearly suggests that in other 
10 instances, the perf erred form of this invention would be 
to perform the reaction with all four ddNTPs present. 

KXAMPLE 4 

Aims : 

15 i. To illustrate that the use of all four 

detectably labeled nucleotides in a single reaction 
gives essentially the same PMT green/red ratio as 
compared to Example 2 when the correct base is present. 
2. To illustrate that a lowering of the 

20 concentration of ddNTP substrate by 5 fold may improve 

the ability to rinse away unincorporated ddNTP 1 s without 
affecting the ability to measure the sample. 

Sample P reparation 
25 The reaction is the same as that described for 

Example 2 except the unlabeled nucleotides are omitted 
from step 7 and all four ddNTPs are present in step 9 at 
1/5 the concentration. The results shown for Mutant and 
Wildtype amplified alleles are shown in the two panels 
30 of Figure 12. The Mutant template resulted in a 
green/red ratio =0.77 suggesting the correct 
incorporation of SF-ddCTP-519, while the Wildtype 
template gave green/red =0.42 suggesting correct 
incorporation of a SF-ddTTP-526 (again see 
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lanes, as in other lanes of the experiment, there is 
essentially no peak of unincorporated nucleotides 
present in the sample. 

5 

EXAMPLE 5 

Aims : 

1. To illustrate the use of a nucleic acid strand 
that is not bound to a solid support as the analyte 

10 strand. 

2. To practice the method of this invention on a 
totally different biological sample than that used in 
Examples 1-4. 

15 Defi nition of the Nucleotide Position of Interest - 

In this example, the nucleotide position of interest 
is that of the lower strand, occupied by a circled G on 
Figure 14. The nucleotide sequence shown is a portion 
of the Wildtype Al gene of maize (Schwarz-Sommer et al. r 

20 EMBO J. 6:287-294 (1987). 

Definition Of the Target. Nucleotide Sequence! 

In this example, the target nucleotide sequence. 
(TNS) is chosen as the 21 nucleotide sequence 
25 (3 * GACGAACTCCTAGCTCATCAC5 f ) that immediately flanks the 
nucleotide of interest such that the nucleotide position 
of interest is the next contiguous nucleotide in the 3' 
to 5" direction on that nucleic acid strand. 

30 The oligonucleotide probe; 

In this example, the oligonucleotide probe consists 
of the 21 nucleotide sequence 5 1 CTGCTTGAGGATCGAGTAGTG 
3 f (Primer C of Figure 14), and is perfectly 
complementary to the target nucleotide sequence defined 
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above . Primer C is biotinylated at the 5 ■ end during 
primer synthesis and is HPI*C purified by methods 
described in Example 1 . 

5 Biological sample: 

DNA sequence analysis of several alleles of the 
maize gene had shown that among other differences , 
the a-dt mutant allele contains a G-C base pair while 
the Wildtype allele had a C-G base pair at the 
10 nucleotide positions circled in Figure 14. For this 

example, total genomic plant DNA is prepared (method as 
described in Shepherd et-al., Mol. Gen. Genet. 188:266- 
271 (1982) from a maize plant known to be homozygous for 
the a-dt mutant allele of the KL gene. 

15 

Amplifying a segme nt of DNA containing the nucleotide 
position of intere st and the target nucleotide sequence: 

PGR amplification primers A & B homologous to a 
section of the maize Al gene are designed, synthesized, 

20 and used to amplify the genomic fragment from total 

maize DNA containing the mutant a-dt allele of maize by 
methods described in Example 1. In this example, the 
Primer' B is the PCR amplification primer containing 
biot in at the 5 % end . 

25 Preparation of the analyte strand is performed as in 

steps 1-6 of Example 1, except that the non-bead bound 
strand from step 6 will be used as the analyte strand 
below. 

30 The sequence of events for using the non-bead bound 

strand as the analyte strand is illustrated in Figure 3 
and the steps are given below. 

7» After denaturation of the two PCR amplification 
strands, the strand that is present in 16 \ll of basic 
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NaOH solution is carefully neutralized by addition of a 
few microliters of 0.5M HC1, monitoring the pH of the 
solution using pH paper. To this neutralized DNA sample 
(vol. aprox. 23 Jll) , the following addition is made: 
5 8.0 [11 of 5X RT buffer (Invitrogen) 

2.8 Jll of 1% Triton X-10 

0.5 |ll of 1 iriM ddATP (unlabeled) 
0.5 [ll of 1 mM ddTTP (unlabeled) 
4.0 p.1 of 6.6 |1M Biotinylated nested primer 
10 8. The sample is incubated 95°C for 2 minutes, 37°C 

10 minutes, and then placed on ice. 

9. The following additions are made: 
2 |ll of 125 [1M SF-ddCTP-519 
2 |X1 of 125 |1M SF-ddGTP-505 
15 1 |ll of Reverse Transcriptase (Invitrogen 10 

H/»il> 

10. The sample is incubated 42°C 10 minutes. 

11. 15 Jll of Dynabeads (prepared as in step 1) are 

added and followed by a 37°C incubation for 15 minutes 
20 with intermitent shaking. This is to promote binding of 
the nested, biotinylated primer (containing fluorescent 
label from the primer extension reaction) . 

13. The sairyple is magnetized for 4 minutes and 
unincorporated nucleotides present in the supernatant 

25 are removed. 

14 . The final bead pellet is washed 3 times with 100 
|il TE (magnetizing each time to remove the buffer) . 

15. The final bead pellet is resuspended in 5 fil of 

FE (95% formamide, 25. mM EDTA) . 
30 16. 2 |11 of this sample along with 1 til of a 

smaller, control primer (Std) are heated for 2 min 95°C, 
before loading on a urea-polyacrylamide gel and 
electrophoresis on the Genesis 2000. 
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The panel of Figure 14 illustrates the peak due to 
incorpor ation of a base complementary to the nucleotide 
position of interest for the Mutant allele (a-dt) . The 
green/red ratio is 2.6, consistent with the correct base 
5 G being added to the biotinylated, nested primer. 
Although this procedure clearly worked, correct 
neutralization of the template strand is a time- 
consuming process that is somewhat variable with respect 
to final pH and resulting salt concentration. The 
10 preferred method is therefore to work as in examples 4-6 
with the biotinylated strand as template. 

F.XAMPLE 6 

Aim: 

15 1. To illustrate the use of another primer 

dependent DNA polymerase in practicing this invention. 

2. To demonstrate that the method of this invention 
is capable of distinguishing a heterozygous DNA sample 
with one reaction (in this case, an equal number of 
20 cytosine and guanine nucleotides at the nucleotide 
position of interest) . 

This example is the same as Example 5 with the 
following exceptions: 

a) Genomic DNA is prepared from maize leaf material 
25 as described in Example 5, but the exact nature of the 

nucleotide of interest in the sample is unknown until 
after the method of this invention is performed. 
(Standard DNA sequence analysis of this region of the 
DNA later confirmed that the biological material is 
30 heterozygous at the nucleotide position of interest with 
both cytosine (C) and guanine (G) being present in 
essentially equal amounts (data not shown) . 

b) In step 7, the following are the additions made 
to the neutralized, non-bead bound strand: 
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8.0 \il of 5X Sequenase buffer (200 mM Tris pH 7, 

100 mM MgC12, and 250 mM NaCl) 
2.8 |ll of 1% Triton X-100 
0.5 [ll of 1 mM ddATP (unlabeled) 
5 0.5 |ll of 1 mM ddTTP (unlabeled) 

4.0 \Ll of 6.6 JIM Biotinylated oligonucleotide 

probe (primer C) 

c) In step 9, 1.5 |il of 100 mM Dithiothreitol is 

added (in addition to the fluorescent substrates SF- 
10 ddCTP-519 and SF-ddGTP-505) , and 1 |Il of 13 units/|ll 

Sequenase Version II enzyme (a modified T7 DNA 
polymerase: US Biochemical Corp. US Patent 4,795/699) 
instead of the reverse transcriptase enzyme. 

d) In step 10, the labeling reaction is at 37°C for 
15 10 minutes. 

The results of Example 6 are shown in Figure 16. It 
is seen that a single peak of incorporation appears, 
suggesting that the Sequenase II enzyme can also be used 
for the practice of this invention with no significant 

20 3' -5' exonuclease activity. The green/red ration (=1.7) 
of this peak is as would be expected for a DNA sample 
that is heterozygous. That is, the calibration graph of 
Figure 7c indicates that the value of 1.7 is 
approximately equal distance between the expected values 

25 for incorporation of SF-ddCTP-519 (green/red 

approximately 0.9) and that for incorporation of 
SF-ddGTP-505 (green/red approximately 2.4). Note that 
in this example, SF-ddATP-512 that gives a green/red 
ratio of approximately 1.6 is not included in the 

30 reaction, thus the 1.7 ratio does not indicate the 
addition of an adenine. 

In conclusion, in Example 6 it is determined that 
the nucleotide position of interest in the sample is 
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occupied by an approximately equal number of cytosine 
(C) and guanine (G) residues. 

EXAMPLE 7 

5 This example illustrates identification of a 

nucleotide of interest at a defined location in samples 
of a 700 bp DNA element, Cinl, from the analysis is 
carried out in a single reaction Northern Flint Line of 
Zea mays in a single reaction. The Cinl element is 
10 repetitive in Zea mavs . 

The oligonucletodie probe is a 21 nucleotide 
sequence perfectly complementary to the target 
nucleotide sequence. The target nucleotide sequence is 
a 21 nucleotide sequence that immediately flanks the 
15 nucleotide of interest at position 500. Synthesis of 
these sequences is as shown in Example 1 . 

The region of interest containing the nucleotide of 
interest on each Cinl template is amplified as in 
Example 1 using amplification primers land 2. Primer 1 
20 is biotinylated at the 5* end. 

Formation of the probe-template hybrid is carried 
out as in Example 1. However, the reaction buffer 

contains : 

1.0 Hi 25 [1M SF-ddTTP-526 
25 1 . 0 [±1 25 (XM SF-ddCTP-519 

1.0 ^1 25 (1M SF-ddGTP-505 

1.0 M-l 25 (IM SF-ddATP-512 

2.0 |il sterile water 

2.0 |il 5X RT buffer 
30 0.5 \Ll Reverse Transcriptase (10 Jl/M-1- 

The labeling reaction takes place at 42°C for 10 
minutes, is placed on ice and 100 \ll of TE buffer is 

added . 
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The chain-terminating nucleotide which extended the 
probe at the position complementary to the nucleotide of 
interest is detected and determined as shown in Example 
1. The nucleotide of interest at position 500 is 
5 identified as the nucleotide complementary to the chain- 
terminating nucleotide which extended the probe in the 
labeling reaction. The presence and nature of a 
polymorphism can be determined by comparing the samples 
tested. 

10 

From the foregoing description, one skilled in the 
art can easily ascertain characteristics of this 
invention, and without departing from the spirit and 
scope thereof, can make various modifications of the 
15 invention to adapt it to various uses and conditions. 



20 



25 



30 
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SEQUENCE LISTING 

(1) GENERAL INFORMATION: 

(i) APPLICANT: LIVAK, KENNETH J. 

RAFALSKI, J. A. 
SHEPHERD, NANCY S. 
(ii) TITLE OF INVENTION: METHOD OF IDENTIFYING A 

NUCLEOTIDE PRESENT AT A 
DEFINED POSITION IN A 
NUCLEIC ACID 

(iii) NUMBER OF SEQUENCES: 34 
(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: DU PONT COMPANY 

(B) STREET: BARLEY MILL PLAZA 36/2152 

(C) CITY: WILMINGTON 

( D ) STATE : DELAWARE 

(E) COUNTRY: USA 

(F) ZIP: 19880-0036 

(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Diskette, 3.50 inch, 1.0 MB 

(B) COMPUTER: Macintosh 

(C) OPERATING SYSTEM: Macintosh System, 6.0 

(D) SOFTWARE: Microsoft Word, 4.0 
(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: 

(B) FILING DATE: 

(C) CLASSIFICATION: 

(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: 07/669,568 

(B) FILING DATE: March 13, 1991 

(viii) ATTORNEY/AGENT INFORMATION: 
(A) NAME: GALLEGOS, R. THOMAS 

(C) REFERENCE /DOCKET NUMBER: CR-8918 
(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: 302-892-7342 
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(B) TELEFAX: 302-892-7949 

(2) INFORMATION FOR SEQ ID NO:l: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

( C ) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:l 

CTGCTGCCCG ACAACAGCAA T 

(2) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:2 

ATTGCTGTTG TCGGGCAGCA G 

(2 ) INFORMATION FOR SEQ ID NO : 3 : 

( i) SEQUENCE . CHARACTERISTICS : 

(A) LENGTH: 26 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:3: 

CAGACATTTG AGAATCAAGT GAATCG 
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(2) INFORMATION FOR SEQ ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4 

CTCGGCTCTC AGGACCATAA TCAT 

(2) INFORMATION FOR SEQ ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 5 : 

ATGTAGAGGG CAAGCGGATC C 

(2) INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid. 

( C ) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 6 : 

GGATCCGCTT GCCCTCTACA T 

(2) INFORMATION FOR SEQ ID NO: 7: 

(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

<ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:7: 

CACTACTCGA TCCTCAAGCA G 

(2) INFORMATION FOR SEQ ID NO: 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 



CTGCTTGAGG ATC GAGTAGT G 

(2) INFORMATION FOR SEQ ID NO: 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 

ATTGCTGTTG TCGGGCAGCA G 

(2) INFORMATION FOR SEQ ID NO: 10: 

{i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 43 base pairs 

(B) TYPE: nucleic acid 
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(C) STRANDEDNESS: double 

(D ) TOPOLOGY : linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 

GGATCCGCTT GCCCTCTACA TTCTGCTGCC CGACAACAGC AAT 

(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 

ATTGCTGTTG TCGGGCAGCA GA 
(2) INFORMATION FOR SEQ ID NO: 12: 



(A) LENGTH: 43 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 

GGATCCGCTT GCCCTCTACA TTCTGCTGCC CGACAACAGC AAT 43 

(2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 base pairs 

(B) TYPE: nucleic acid 



(i) 



SEQUENCE CHARACTERISTICS : 



(C) 
(D) 



STRANDEDNESS : double 
TOPOLOGY : linear 
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(ii) 
(xi) 



MOLECULE TYPE: DNA (genomic) 
SEQUENCE DESCRIPTION: SEQ ID NO: 13: 



ATTGCTGTTG TCGGGCAGCA GG 



22 



(2) INFORMATION FOR SEQ ID NO: 14: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 43 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 

GGATCCGCTT GCCCTCTACA TCCTGCTGCC CGACAACAGC AAT 43 

(2) INFORMATION FOR SEQ ID NO: 15: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 43 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 

ATTGCTGTTG TCGGGCAGCA GAATGTAGAG GGCAAGCGGA TCC 43 

(2) INFORMATION FOR SEQ ID NO: 16: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO; 16: 

GGATCCGCTT GCCCTCTACA T 21 

(2) INFORMATION FOR SEQ ID NO: 17: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 43 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 

ATTGCTGTTG TCGGGCAGCA GAATGTAGAG GGCAAGCGGA TCC • 43 

(2) INFORMATION FOR SEQ ID NO: 18: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D ) TOPOLOGY : linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18: 

GGATCCGCTT GCCCTCTACA TT 22 

(2) INFORMATION FOR SEQ ID NO: 19: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 43 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19: 

ATTGCTGTTG TCGGGCAGCA GGATGTAGAG GGCAAGCGGA TCC 43 

(2) INFORMATION FOR SEQ ID NO: 20: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 20: 

GGATCCGCTT GCCCTCTACA TC 22 

(2) INFORMATION FOR SEQ ID NO: 21: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 43 base pairs 

(B) TYPE.: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: * linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 21: 

GGATCCGCTT GCCCTCTACA TTCTGCTGCC CGACAACAGC AAT 43 

•(2) INFORMATION FOR SEQ ID NO:22: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 45 base pairs 

(B) "TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 22: 

GGATCCGCTT GCCCTCTACA TTTTCTGCTG CCCGACAACA GCAAT 

(2) INFORMATION FOR SEQ ID NO: 23: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 23: 

TGCTGTTGTC GGGCAGCAGA AT 

(2) INFORMATION FOR SEQ ID NO: 24: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 43 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 24; 

GGATCCGCTT GCCCTCTACA TTCTGCTGCC CGACAACAGC AAT 

(2) INFORMATION FOR SEQ ID NO: 25: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO:25: 

TGCTGTTGTC GGGCAGCAGA AA 22 

(2) INFORMATION FOR SEQ ID NO: 26: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 45 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 26: 

GGATCCGCTT GCCCTCTACA TTTTCTGCTG CCCGACAACA GCAAT 45 

(2 ) INFORMATION. FOR SEQ ID NO : 27 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 603 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:27: 

CAGACATTTG AGAATCAAGT GAATCGTATT CTCAATGATG CTCGAGACAA AACTGGCTCC 60 

TCTGCACAGA AATCCCTCTC TGAATATAAC AACTTCAAGT CTATGGTGGT GTCTGGAGCC 120 

AAGGGTTCCA AGATCAACAT CTCCCAGGCA AGATGCTTCA TTTTCCAGAT ATGTGGCCTA 180 

TACCAGAGTT TGTAAAGAGG ATGGTATGTA CATGTTTTGG TGTGAGGAAA GATGGAAAAA 240 

ATAGTAGGGA ATTGTCACCA CCACCACCAC TGCTGCAGTG TCATGGCTTG AAACAAGATT 300 

CACTCACGTG TAAAAGACCT TTTTTAAAAC AAAACAAAAC ATGGTTTTGC TGTGTAGCCC 360 
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AGGTTGAGTG TGAACTTTGT ATCTTTCTGC CTCCTCTTTC CAACTTTAGG TTTCAGGCAT 420 
GCACTATTTC TGCCATAAAT TCATACTTTT AATGCTAGGG GAAATCATAT GCAGCCTTTC 480 
CCCCCCCTTA GGTCATTGCT GTTGTCGGGC AGCAGAATGT AGAGGGCAAG CGGATCCCAT 540 
TTGGATTCAA GCATCGGACT CTTCCTCACT TTATCAAGGA TGATTATGGT CCTGAGAGCC 600- 
GAG 

(2) INFORMATION FOR SEQ ID NO: 28: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 603 base pairs 
, (B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:28: 

CTCGGCTCTC AGGACCAXAA TCATCCTTGA TAAAGTGAGG AAGAGTCCGA TGCTTGAATC 60 

CAAATGGGAT CCGCTTGCCC TCTACATTCT GCTGCCCGAC AACAGCAATG ACCTAAGGGG 120 

GGGGAAAGGC TGCATATGAT TTCCCCTAGC ATTAAAAGTA TGAATTTATG GCAGAAATAG 180 

TGCATGCCTG AAACCTAAAG TTGGAAAGAG GAGGCAGAAA GATACAAAGT TCACACTCAA 240 

CCTGGGCTAC ACAGCAAAAC CATGTTTTGT TTTGTTTTAA AAAAGGTCTT TTACACGTGA 300 

GTGAATCTTG TTTCAAGCCA TGACACTGCA GCAGTGGTGG TGGTGGTGAC AATTCCCTAC 360 

TATTTTTTCC ATCTTTCCTC ACACCAAAAC ATGTACATAC CATCCTCTTT ACAAACTCTG 420 

GTATAGGCCA CATATCTGGA AAATGAAGCA TCTTGCCTGG GAGATGTTGA TCTTGGAACC 480 
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CTTGGCTCCA GACACCACCA TAGACTTGAA GTTGTTATAT TCAGAGAGGG ATTTCTGTGC 540 

AGAGGAGCCA GTTTTGTCTC GAGCATCATT GAGAATACGA TTCAC TTGAT TCTCAAATGT 600 

CT( 3 603 

(2) INFORMATION FOR SEQ ID NO: 29: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 60 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:29: 

CCCCCCCTTA GGTCATTGCT GTTGTCGGGC AGCAGAATGT AGAGGGCAAG CGGATCCCAT 60 



(2) INFORMATION FOR SEQ ID NO: 30: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 60 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 30: 

ATGGGATCCG CTTGCCCTCT ACATTCTGCT GCCCGACAAC AGCAATGACC TAAGGGGGGG 60 

(2) INFORMATION FOR SEQ ID NO: 31: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 60 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 31: 

CCCCCCCTTA GGTCATTGCT GTTGTCGGGC AGCAGGATGT AGAGGGCAAG CGGATCCCAT 60 

(2) INFORMATION FOR SEQ ID NO: 32: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 60 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 32: 

ATGGGATCCG CTTGCCCTCT ACATCCTGCT GCCCGACAAC AGCAATGACC TAAGGGGGGG 60 

(2) INFORMATION FOR SEQ ID NO: 33: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 295 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D ) TOP OLOG Y : 1 inear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 33: 

GTCGTGCGAG GAGCAGACGT AGCGCCCGGC CGCGGCCGGG TTCTCGAAGA GGAAGATCTC 60 

GGCGTCGCAG AGGTCGTCGA GGTGGATGAG CTGCACCTGC TTGAGGATCG AGTAGTGCGG 120 

CGCGTTCCCC GTGATGAGCG CCAGCGCGGT GATGAGGCTG GGCGGCATGG ACGCGCTGAT 180 

GAACGGGCCG ACCACGAGCG TCGGGATGAT GGTGACCAGG TCCAGGCCGT GCXCCGCCGC 240 

GTACGCCAGG GCCGCCTTCT CCGCCAGGGT TTTAGACACG AAGTACATCT GCAGG 295 
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(2) INFORMATION FOR SEQ ID NO: 34: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 295 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 34: 

CCTGCAGATG TACTTCGTGT CTAAAACCCT GGCGGAGAAG GCGGCCCTGG CGTACGCGGC 60 

GGAGCACGGC CTGGACCTGG TCACCATCAT CCCGACGCTC GTGGTCGGCC CGTTCATCAG 120 

CGCGTCCATG CCGCCCAGCC TCATCACCGC GCTGGCGCTC ATCACGGGGA ACGCGCCGCA 180 

CTACTCGATC CTCAAGCAGG TGCAGCTCAT CCACCTCGAC GACCTCTGCG ACGCCGAGAT 240 

CTTCCTCTTC GAGAACCCGG CCGCGGCCGG GCGCTACGTC TGCTCCTCGC ACGAC 295 
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What is claimed is : 

1. A method of identifying a nucleotide of interest 
5 present at a defined position in a nucleic acid analyte, 
comprising: 

a) contacting the nucleic acid analyte with a 
probe such that annealing takes place adjacent to the 
nucleotide of interest to form a hybrid; 
10 b) contacting the hybrid with four chain 

terminating nucleotides; 

c) extending the probe in the direction of the 
nucleotide of interest by addition of the chain 
terminating nucleotide complementary to the nucleotide 

15 of interest; 

d) determining which chain terminating 
nucleotide was added; and 

e) identifying the nucleotide of interest as 
the nucleotide complementary to the chain terminating 

20 nucleotide which was added, 

2 . The method of Claim 1 wherein the nucleic acid 
analyte is a sequence of DNA or RNA. 

25 3. The method of Claim 1 wherein the probe is a 

sequence of DNA or RNA. 

4 . The method of Claim 1 wherein the chain 
terminating nucleotides are dideoxynucleotides . 

30 

5. The method of Claim 1 wherein the probe is 
extended chemically. 
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6. The method of Claim 1 wherein the probe is 
extended enzymatically . 

7. The method of Claim 1 wherein the added chain 
5 terminating nucleotide is determined by detecting the 

presence of a signal generator. 

8. The method of Claim 1 wherein the nucleic acid 
analyte is single stranded. 

10 

9. The method of Claim 1 wherein the nucleic acid 
analyte is immobilized on a solid support . 

10. The method of Claim 1 wherein the probe is 
15 immobilized on a solid support. 

11. A kit for identification of a nucleotide of 
interest in a nucleic acid analyte, comprising: 

a) a probe which comprises a primer sequence 

20 complementary to the nucleic acid analyte and capable of 
binding the nucleic acid analyte with sufficient 
specificity to form a stable hybrid adjacent to the 
nucleotide of interest; 

b) a plurality of reporter labeled chain 
25 terminating nucleotide triphosphates; and 

c) a primer-dependent nucleic acid polymerase . 

12. A method of identifying a nucleotide of interest 
present at a defined position in a nucleic acid analyte, 

30 comprising: 

a) contacting the nucleic acid analyte with a 
probe such that annealing takes place adjacent to the 
nucleotide of interest; 
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b) contacting the hybrid with at least one 
chain terminating nucleotide; 

c) extending the probe in the direction of the 
nucleotide of interest by addition of the chain 

5 terminating nucleotide complementary to the nucleotide 
of interest; 

d) determining which chain terminating 

nucleotide was added; and 

e> identifying the nucleotide of interest as 
10 the nucleotide complementary to the chain terminating 
nucleotide which was added. 

13. The method of Claim 12 wherein the nucleic acid 
analyte is a sequence of DNA or RNA. 

15 

14. The method of Claim 12 wherein the probe is a 
sequence of DNA or RNA. 

15. The method of Claim 12 wherein the chain 
20 terminating nucleotides are dideoxynucleotides • 

16. The method of Claim 12 wherein the probe is 
extended chemically. 

25 17. The method of Claim 12 wherein the probe is 

extended enzymatically . 

18. The method of Claim 12 wherein the added chain 
terminating nucleotide is determined by detecting the 
30 presence of a signal generator. 



19. The method of Claim 12 wherein the nucleic acid 
analyte is single stranded. 
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20. The method of Claim 12 wherein the nucleic acid 
analyte is immobilized on a solid support . 

21. The method of Claim 12 wherein the probe is 
5 immobilized on a solid support . 

22. A kit for identification of a nucleotide of 
interest in a nucleic acid analyte, comprising: 

a) a probe which comprises a primer sequence 

10 complementary to the nucleic acid analyte and capable of 
binding the nucleic acid analyte with sufficient 
specificity to form a stable hybrid adjacent to the 
nucleotide of interest; 

b) at least one of reporter labeled chain 
15 terminating nucleotide triphosphates; and 

c) a primer-dependent nucleic acid polymerase. 



20 



25 
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